New strategy for the representation and the integration of biomolecular knowledge at a cellular scale.

نویسندگان

  • Roland Barriot
  • Jérôme Poix
  • Alexis Groppi
  • Aurélien Barré
  • Nicolas Goffard
  • David Sherman
  • Isabelle Dutour
  • Antoine de Daruvar
چکیده

The combination of sequencing and post-sequencing experimental approaches produces huge collections of data that are highly heterogeneous both in structure and in semantics. We propose a new strategy for the integration of such data. This strategy uses structured sets of sequences as a unified representation of biological information and defines a probabilistic measure of similarity between the sets. Sets can be composed of sequences that are known to have a biological relationship (e.g. proteins involved in a complex or a pathway) or that share similar values for a particular attribute (e.g. expression profile). We have developed a software, BlastSets, which implements this strategy. It exploits a database where the sets derived from diverse biological information can be deposited using a standard XML format. For a given query set, BlastSets returns target sets found in the database whose similarity to the query is statistically significant. The tool allowed us to automatically identify verified relationships between correlated expression profiles and biological pathways using publicly available data for Saccharomyces cerevisiae. It was also used to retrieve the members of a complex (ribosome) based on the mining of expression profiles. These first results validate the relevance of the strategy and demonstrate the promising potential of BlastSets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neuron Mathematical Model Representation of Neural Tensor Network for RDF Knowledge Base Completion

In this paper, a state-of-the-art neuron mathematical model of neural tensor network (NTN) is proposed to RDF knowledge base completion problem. One of the difficulties with the parameter of the network is that representation of its neuron mathematical model is not possible. For this reason, a new representation of this network is suggested that solves this difficulty. In the representation, th...

متن کامل

Multi Objective Scheduling of Utility-scale Energy Storages and Demand Response Programs Portfolio for Grid Integration of Wind Power

Increasing the penetration of variable wind generation in power systems has created some new challenges in the power system operation. In such a situation, the inclusion of flexible resources which have the potential of facilitating wind power integration is necessary. Demand response (DR) programs and emerging utility-scale energy storages (ESs) are known as two powerful flexible tools that ca...

متن کامل

A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...

متن کامل

Simulation and Evaluation of Urban Development Scenarios Using Integration of Cellular Automata Model and Game Theory

Urban growth is a dynamic and evolutionary spatial and social process that relates to the changes of urban spatial units and the transformation of people’s lifestyles and consequently demographic changes. Considering the urban development process as a function of land uses interactions, population structure and the strategic behavior of the agents involved in the urban development process (the ...

متن کامل

طراحی الگوی انتخاب راهبرد ادغام عمودی در صنایع غذایی کشور در سال 1380 (شرکت‌های دارای نیروی انسانی بیش از ۳۵ نفر)

It is impossible to develop and retain the Competitive advantage and gain success without Implementing strategy. The method of strategy selection is one of the most important challenges for the strategists. Different models have been formulated to enable managers to choose the appropriate strategies and some of these models are just formulated for the selection of vertical integration strategie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 32 12  شماره 

صفحات  -

تاریخ انتشار 2004